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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: Qt 

Date Processed by STIC: l/ZfofenDV 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3,1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www,uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel 'Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box SequeAce, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 

Revised 01/29/2002 



Raw Sequence Lb tin £ Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: m 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



I Wrapped Nucleic* 
Wrapped Aminos 



2 Invalid Line Length 

3 lS Misaligned Amino 
Numbering 

4 Non-ASCII 

3 .Variable Length 



6 Patcntln 2.0 

~bug M 



7 Skipped Sequences 

(OLD RULES) 



8 Skipped Sequences 

(NEW RULES) 



10 



11 



12 



Useofn'sorXaa's 

(NEW RULES) 



_lnvalid<213> 
Response 

Use of <220> 



Patcntln 2.0 
" "bug" 



The numbcrAcxt at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor alter creating it Please adjust your right margin to J; this will 
prevent "wrapping," 

The rules require that a line not exceed 72 characters in length. This includes white spaces. 

The numbering under each 5* amino add is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

Hie submitted file was not saved in ASCD(DOS) text, as required by the Sequence Rules. Please- 
ensure your subsequent submission Is saved In ASCII text 

Sequence^) contain n*s or Xaa*s representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section/that some may be missing. 

»■ 

A "bug" in Patcntln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences^) . Normally, Patcntln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino icid sequence., This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences.* 

Sequences) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where X is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" # is shown) 
This sequence is intentionally skipped 

Pleascalso adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 

Sequence^) missing. If Intentional, please insert the following lines for each skipped sequence. 

<2 10> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n'i or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa; and which residue n or Xaa represents. 

•> ^ 

Per 1.823 of Sequence I&lci the only vaUd <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Oenus/spccies). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence f. 

Sequences) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> lo <223> section. 

(See "Federal Register," 06701/1998, VoL 63, No, 104, pp. 29631 -32) (Sec 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patcntln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse ofn 



n can only be used to represent a single nucleotide in a nucleic acid sequence, N is not used to represent 
any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - OS/2 1/2001 
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OIPE 



RAW SEQUENCE LISTING DATE: 
PATENT APPLICATION: US/10/032 , 260 TIME: 



02/26/2002 
11:05:42 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\02262002\J032260.raw 



/-7 



Does Not Comply 
Corrected Diskette Needed 



3 <110> APPLICANT: de Belle, Ian 

4 Adamson, Eileen 

5 Mercola, Dan 

7 <120> TITLE OF INVENTION: Isolation and Identification of Control Sequences and 

8 Genes Modulated by Transcription Factors 
10 <130> FILE REFERENCE: PS-00101.P.1 

12 <140> CURRENT APPLICATION NUMBER: US/10/032 , 260 

13 <141> CURRENT FILING DATE: 2001-12-20 
15 <160> NUMBER OF SEQ ID NOS : 27 

17 <170> SOFTWARE: Patentln Ver. 2.0 



ERRORED SEQUENCES 

35 <210> SEQ ID NO: 3 

36 <211> LENGTH: 20 

37 <212> TYPE: PRT 

38 <213> ORGANISM: Homo sapiens .. • 
4 0 <4 00> SEQUENCE: 3 \ 

41 Cys Asp Asn Phe Ser A^a Tyr Gly Trp Cys Pro Leu Gly Pro Gin Cys ^^^^^^I^L^^a^f i^^-^-t/' 

b— > 42 1 r ' £ -5- ic ±* J*- IS /^^^TT j 

44 Pro Gin Ser His &-<C*J frU****"*- — 

45 20 Ji a 7 cfys 

47 <210> SEQ ID NO: 4 

48 <211> LENGTH: 31 £/IaS\ jl<M^y+**«*l 

49 <212> TYPE: PRT - f / J 

50 <213> ORGANISM: Homo sapiens ^I^UjlA^^M^^ 

52 <400> SEQUENCE: 4 ^ i f . 

53 lie lie Asp Thr Asp Glu Ala Ala Ala Glu Asp Lys Arg&&q> fyrg Arg — JAaJV 
E--> 54 1 5 10 15<^g ^^M VW/O ^) 

56 Arg Arg Arg Glu Lys Arg Lys Arg Ala Leu Leu Asn Leu Pro Gly 

E--> 57 20 25 30 

59 <210> SEQ ID NO: 5 

60 <211> LENGTH: 13 

61 <212> TYPE: PRT 

62 <213> ORGANISM: Homo sapiens 

64 <400> SEQUENCE: 5 

65 His Arg Ala Gly Phe Asp Ala Phe Met Thr Gly Tyr Val 
E--> 66 1 5 10 (/O^yr^ 

221 <210> SEQ .ID NO: 16 

222 <211> LENGTH: 234 

223 <212> TYPE: PRT 

224 <213> ORGANISM: Homo sapiens ^ ^ 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/032 f 260 



DATE: 02/26/2002 
TIME: 11:05:42 



E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 



226 
227 
228 
229 
230 
231 
232 
233 
234 
235 
236 
237 
238 
239 
240 
241 
242 
243 
244 
245 
246 
247 
248 
249 
250 
251 
252 
253 
254 
255 
256 
258 
259 
260 
261 
263 
264 
265 
266 
267 
268 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\02262002\J032260.raw 



<400> SEQUENCE: 16 



Met 
1 

Arg 

Cys 

He 

Arg 
65 
Thr 

Asp 

Thr 



Arg Asp His Tie Asp 
5 

Pro His Pro Thr Ser 
20 

Pro Leu Gly Pro Gin 
35 

Asp Thr Asp Glu Ala 
50 



Tyr Arg Cys Cys Leu Pro Pro Ala Thr His 

10 15 
He Cys Asp Asn Phe Ser Ala Tyr Gly Trp 

30 

His Asp He Asp Pro He 
45 

Arg Arg Arg Arg 
60 

Pro Gly Thr Gin 



Cys 



Asp Asn 
25 

Pro Gin Ser 
40 

Ala Ala Glu Asp Lys Arg 
55 

Arg Glu Lys Arg Lys Arg Ala Leu Leu Asn Leu 

70 75 
Ser Gly Glu Ala Lys Asp Gly Pro Pro Lys Lys 

85 90 
Ser He Lys Pro Glu Glu Thr Glu Gin Glu Val 
100 105 



80 



Gin Val Cys Gly 

Ala Ala Asp Glu 
110 



Arg Asn Leu 
115 

Met Gly He Lys 
130 

Glu Val Pro Gly 
145 

Leu His Arg Ala 

Tyr Val Glu Val 
180 

Leu Pro Glu Cys His 
195 

Leu Thr Val Ala Lys 
210 



Pro His Ser Lys Gin Gly Asn Lys Asn Asp Leu Glu 

120 125 
Ala Ala Arg Pro Glu He Ala Asp Arg Ala Thr Ser 

135 140 
Ser Gin Ala Ser Pro Asn Pro Val Pro Gly Gly Gly 

150 155 160 

Gly Phe Asp Ala Phe Met Thr Gly Tyr Val Met Ala 
165 170 175 

Ser Gin Gly Pro Gin Pro Cys Ser Ser Gly Pro Trp 
185 190 
Asn Lys Val Tyr Leu Ser Gly Lys Ala Val Pro 

200 205 
Ser Gin Phe Ser Arg Ser Ser Lys Ala His Asn 
215 220 



Gin Lys Met Lys Leu Thr Trp Gly Ser Ser 
225 230 
<210> SEQ 
<211> LENGTH: 
<212> TYPE: 
<213> ORGANISM: 
<40 0> SE QUENCE? ; 



ID NOjT7./ 
3: DNS — 



Homo 



sapiens 



agag 60 



E--> 

E--> 
E--> 
E--> 
E--> 
E--> 
E--> 



gg@3^5J g ^innnij gggg gaactt^njtat cggtgcctac tcadfnjfcjaaaa ggctgaagag 60 

tctcccatgt ctacttcttt ctacacagac acagcaacca tccgatttct caatcttttc 120 

cccacctttc ccccttttct attccacaaa accgccattg tcatcatggg ccgttctcaa 180 

tgagctgttg ggtgagatat tagaattcta ctcacagaac gaaatgaaaa gtctcccatg 240 

tctacttctt ctacacaaga cacagcaaca tccgatttct caatcctttc cc caactttc 300 

2 6 ^ccccttttct antccacaan-accgccattg tcatcatggg ncgttctcaa ^ga^ ^gtrbg^.aeQ^^^^^ ^ 

270 aatoaaatat taaaattctq qqctcrcrcraat qacrttcaqc c taataaaata taaaCCtqca-^420 *^ 
27 

^1 



27 
27 
27 
27 
27 



ggtga gatat Tagaattctg ggctcrqcraat craottcaqcc tggtgga atg tgaacctgc 
'i*cag€ttggc atgaacgggc aaatgctgtg tancctccgg aaaggagcgc ttcctggaacf^ 8 0 
ctggcgcctg actttgtggg ngacatcctc cgggaaaang gttcactant tctaaagcgg 
gcggcaacgc ggtggggctc caattcgccc taaantgngt ccgtattaca attcacnggg 
cggccgtttt anaagtcctg nncggggaaa accnggggt anccaacttt atcnccctgg 
nngaaanccc cccttncnca acnggggtna naacchahn n gggccncccn ntttgcccct 



cccaa 
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RAW SEQUENCE LISTING DATE: 02/26/2002 

PATENT APPLICATION: US/10/032, 260 TIME: 11:05:42 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02262002\J032260.raw 



l NO: 18 _ ] 

dnK — 



278 <210> SEQ ID NO:^L8 

279 <211> LENGTH 

280 <212> TYPE: DN 

281 <213> ORGANISM: Homo sapiens 

283 <400> SEQUENCE: 18 

284 agaagcttga attcgagcag agaagcttga attcgagcag aattggccca attttgcctt 60 

285 ataccacttt ccaatacctt cacttggagt gacttacact gtggttaatt gcagttacaa 120 

286 tgaagagatt aacatgggaa tgtcataata attgaatcta aagaagacat aatttcaaaa 180 

287 taagagcttg agtaataata ccattgtgta acaatctgat ttccatccct cttatttttc 240 

288 ctatattatg cagtttagtt ctttactatc atgtgtttca tgtttgttcg gttttaccaa 300 

289 cacatcatta gtaaattgaa tgtaaggctt ctcatttctt ttgtatccta catctaaaag 360 

290 attttagtcc ttagaatcct cttgaaatgt tctccattta aaatggagaa atagttcatg 420 
E--> 291 ctctctcatc taagtangag ctaaaatcta aaaaattaat aaataaaata gtccatcctc 48 
E--> 292 taataataat aatgaatact gaanttgtta antaataatt aatttttgag aagggggttc 54 



£i — / taauatiiai. aaigaauatju yci<iin_«_y uuci anuaauaatu ddi.LLn.yay aay y yyy lll j i Oj K/ ' 

E--> 293 actaatgcg tccaagctgg agtgcaatgg cgtgatcact aanttctaaa ncggcgccaa /fofrfSna . . 
E--> 294 cgcggtggag ctccaantn (111) fc/? W5«?1 KUmbv^ OfT 




296 <210> SEQ ID NO: 19 

297 <211> LENGTH: 716 

298 <212> TYPE: DNA 

299 <213> ORGANISM: Homo sapiens 
301 <400> SEQUENCE: 19 

E- - > 302 ggngtgggng nnnggggggg ggnntttnng gnncggntnt tctnaagtnt ccngggcctc 
E--> 303 atnaaacagc gggccgagaa cgggncaana tgacaatggn ggttttgtgg aatagaaaag 

304 ggggaaaggt ggggaaatga ttgagaaatc ggatggttgc tgtgtctgtg tagaaagaag 

305 tagacatggg agacttttca ttt tgttctg tgagtagaat tctgggctgg gaatgag ^ 
E--> 306/agcct^gtga atgtgaacct gcaccagttt ggcatgaacg gncagatgct gtgtaaccti 
E--> 30 7 ggcaaggagc gcttcctgga gctggcgcct gactttgtgg gcgacatcct ctggnacag 
E--> 30a* ntccactagt tctagagcgg gcgccaccgc ggtggngctc caattcgccc tanagtgngt' 
E--> 3C9 cgtnttacaa ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggngtta 
E--> 310 cccaacttaa tcgccttgca gcanatcccc ctttcgncag ctggngtnnt ancgangagg 
E--> 3J1 nccgcaccgn ttgcccntcc caanaagttg cgcagcctgn atggggantg ggancgncct 
E--> 3} 2 gtnncgggng cantaagcgc ggngggtgtg gtggntangc ncancgtgnn cgnnnnannt 
E--> 31x^jnnagngcct tangccngnn ccttcgnttc tcccttcctt cnngnnangt ngcggg 716 

315 <2lU> SKy ID wu: AU " = 

316 <211> LENGTH: 619 

317 <212> TYPE: DNA 

318 <213> ORGANISM: Homo sapiens 

320 <400> SEQUENCE: 20 

321 agaagcttga attcgagcag agaagcttga attcgagcag aattggccca attttgcctt 60 

322 ataccacttt ccaatacctt cacttggagt gacttacact gtggttaatt gcagttacaa 120 
32 3 tgaagagatt aacatgggaa tgtcataata attgaatcta aagaagacat aatttcaaaa 180 

324 taagagcttg agtaataata ccattgtgta acaatctgat ttccatccct cttatttttc 240 

325 ctatattatg c agtttaagt tctttactat catatatttc atxp^fefec^c^axilLtttacca 300 
E--> 326/^arra^caE : £ ^agtaaattqa atqtanggct tctcatttct tttgtatcct acatctaal Hr""3^]r 

327 g aLLLLdg tc tttaga atcc tcttgaaatg ttctcca ttt aaa atggaga aatagttcat^ 42 0 
E--> 32 8/gctctctcat ctaantanga gctaaaatct aaaaaataaa taaataaaat antccatcct 4 $°-^ . 
E--> 32£ ctaataataa taatgaatac tgaanttgta aataataatt aatttttgag aatggggttc S4 0 
E--> 330 actaatgtcg tccaanctgg agtgcaatgg cgtgatcact agttctaaac cggcgccaac p00/** /C> ^^ ' 
E--> 3B1 gcggtgggnc tccaattcc 619 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/10/032 , 260 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02262002\J032260.raw 



DATE: 02/26/2002 
TIME: 11:05:42 



E--> 
E--> 

E--> 



E--> 
E -> 
E--> 
E--> 
E--> 



356 
357 
358 
359 
361 
362 
36 
3 
3 



<210> SEQ ID NO: 22 
<211> LENGTH: 419 
<212> TYPE: DNA 
<213> ORGANISM: Homo sapiens 
<400> SEQUENCE: 22 
actt qaqtcc aqqaj 

gacaga gtgagaccct gtctgttaaa 
gagtg taaag aa agacag ga tg^ttrtt fi g 
a tPif rjtttqna t atg agrrhg a 



u z auuuyay lu 

63/'og3tgacag 
6ft gagtgtaaa 

6*5 a fj^r^T 



aaataataat aataatagat aatgggatan> k2 0 
raa^gt-taca aaaaatatta atangtctttj*3 0 



3 66 lttctcaaggt acaactttga tgcagttcan 

3 67 tttgggaaat aaccattgtg gcaaggctgg 
3 68 acagtaaatg aatttatggg attttatttt 

387 <210> SEQ ID NO: 24 

388 <211> LENGTH: 584 

389 <212> TYPE: DNA 

390 <213> ORGANISM: Homo sapiens 
392 <400> SEQUENCE: 24 

3 93 agagcagtcc agtatatata catacatata 
3 94 ttttctaata accacattta aaaaggtaaa 
395 ttcattgaac ccaatatatg caaaatacta 
3 96 ggagatattt tacaattttc atattaacgt 



397 cgaacatctc aattctgaca agtcatattt 



gaagaqaaaa tgaaaaaqtg aaaataagat 240 
gtcaaactta ngtaagattt 1-gtt.qtanaa^300 
aatgcaaatc gattttttgc tgttacagaa 360 
aatttagtta gctttttatg aggagaatt 419 



—) /jU<rr> 9 




caagctacaa gctgcatatg taatttaaaa 60 
aagaaactgt tgaaataaat tttaatatct 120 
tcatttcaat tataaccaaa ttaaaattaa 180 
ttccaattct ggtgtgaatt ttacactcac 240 
taagtgntna acagctacgt gaggatagtg 300 



3 98/gctattatgt cacaaaatgc agctctangg atgaggacag tttacagaag atacttgagg 36 0 
atacaggagc aagttaaatg gcagtttaag aaagcaaatc cangatgtgg gaaactccac 42 0 
agaatanatg acctggtttc tcccttcact catccctcca aaatagaaat caatggcaga 4* 0 
aagaaaaaag anggaggctg ttgtancata aaatacttag ggacatacaa taaaaacagt 
gtagggtttt gttgaanccg attcactaca atgattcaca antt 584 

405 <211> LENGTH: Ql8^ / 

406 <212> TYPE: DN^"^ ' 

407 <213> ORGANISM: Homo sapiens 
409 <4 00>_SJSQIIEN€E-r-25 " 

110/ggggnnnntn tnnngnaaat ctctgngttc gggccccccc ancaaggtcg aggccta 1 

ataagctana tatcggaattc ctgcagcccg ggggatctga tggttttata aaggggagt 
gccctgcgaa agctctctct tacctgccgc catgtaagac cggactttgc tec tcattag Q 8 
gtcaccctag ccatgtggaa ctgtgagtcc attaaacctc tttcctttat aaattatgea 240 
gtcteggata tgtctttatt agcaaggtga aaatgaacta atacaagggt cacgtggtaa 300 
atatatttaa tattaaaaaa aaatcttcca aactattttc cagagtgtct gtaccttttt 360 
acatttccat gagcaaegta tgagtgattt agtttctttg acagcatttg gtatagttac 
„ tattttttat tttagttgtt ctcatcctgg acttaatttg aattttccca atgatg 
418 a tgttgaaaa ttttcttgt gcttacttgt catctgga ta ttctcgtcaa ta aaatgtct 540 
cttantatcn tttgeccatt ttcaantgga ttccttttgt gttttatcat tgaattttaa 
gaattcttcn atttatagat atgaattaca gatanaatca tagatattat agatanatat 
gagttatggt tcacnatt 678 




423 <210> 

424 <211> 

425 <212> 

426 <213> 



SEQ ID NO: 26 
LENGTH: 509 
TYPE: PRT 
ORGANISM: Homo 



taT\600 
tat 660 



428 <400> SEQUENCE: 26 



sapiens 



p5 
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RAW SEQUENCE LISTING DATE : 02/26/2002 

PATENT APPLICATION: US/10/032 f 260 TIME: 11:05:42 

Input Set : 
Output Set: 

429 Met Ala Ala Asp Ser Asp 
E--> 430 1 

431 Asp Gly Gly Val Ser Lys 
E--> 432 20 

433 Gin Val Pro Val Val Asp 
E--> 434 35 

435 Pro Ser Leu Leu Ala lie 
E--> 436 50 

437 Glu Leu Ser Gly Leu Gly 
E--> 438 65 70 

439 Glu Glu Arg Tyr Lys Ala 
E--> 440 

441 Leu Ser Leu Gly Leu Ala 
E--> 442 100 

443 His Ser Tyr Leu Ala Gin 
E--> 444 115 

445 Glu Tyr Val lie Glu Pro 
E--> 446 130 

447 Phe Asn Phe Asn Gin Gin 



E- 


-> 


448 


145 










150 










155 




160 






449 


Asn 


Asp 


Lys 


Gly 


Asp 


Glu Ser 


Gin 


Ser 


Gin 


Ser 


Val Arg Thr 


Leu 


Phe 


E- 


-> 


450 












165 








170 




175 






451 


Leu 


Glu 


Leu 


He 


Arg Ala Arg Arg 


Pro 


Leu 


Val 


Leu His Asn Gly 


Leu 


E- 


-> 


452 










180 








185 






190 






453 


He 


Asp 


Leu 


Val 


Phe 


Leu Tyr 


Gin 


Asn 


Phe 


Tyr 


Ala His Leu 


Pro 


Glu 


E- 


-> 


454 






195 








200 




205 








455 


Ser 


Leu 


Gly 


Thr 


Phe 


Thr Ala 


Asp 


Leu 


Cys 


Glu 


Met Phe Pro 


Ala 


Gly 


E- 


-> 


456 




210 








215 






220 










457 


He 


Tyr 


Asp 


Thr 


Lys 


Tyr Ala Ala 


Glu 


Phe 


His 


Ala Arg Phe 


Val 


Ala 


E- 


-> 


458 


225 










230 








235 




240 






459 


Ser 


Tyr 


Leu 


Glu 


Tyr Ala Phe 


Arg 


Lys 


Cys 


Glu 


Arg Glu Asn 


Gly 


Lys 


E- 


-> 


460 












245 








250 




255 






461 


Gin 


Arg 


Ala 


Ala 


Gly 


Ser Pro 


His 


Leu 


Thr 


Leu 


Glu Phe Cys 


Asn 


Tyr 


E- 


-> 


462 










260 








265 




270 






463 


Pro 


Ser 


Ser 


Met 


Arg 


Asp His 


He 


Asp 


Tyr 


Arg 


Cys Cys Leu 


Pro 


Pro 


E- 


-> 


464 








275 








280 






285 








465 


Ala 


Thr 


His 


Arg 


Pro 


His Pro 


Thr 


Ser 


He 


Cys 


Asp Asn Phe 


Ser 


Ala 


E- 


-> 


466 




290 








295 






300 










467 


Tyr 


Gly Trp 


Cys 


Pro 


Leu Gly 


Pro 


Gin 


Cys 


Pro 


Gin Ser His 


Asp 


He 


E- 


-> 


468 


305 










310 










315 




320 






469 


Asp 


Leu 


He 


He 


Asp 


Thr Asp 


Glu 


Ala 


Ala 


Ala 


Glu Asp Lys 


Arg 


Arg 


E- 


-> 


470 












325 










330 




335 






471 


Arg 


Arg 


Arg 


Arg 


Arg 


Glu Lys 


Arg 


Lys 


Arg 


Ala 


Leu Leu Asn 


Leu 


Pro 


E- 


-> 


472 












340 








345 




350 






473 


Gly 


Thr 


Gin 


Thr 


Ser 


Gly Glu 


Ala 


Lys 


Asp 


Gly Pro Pro Lys 


Lys 


Gin 


E- 


-> 


474 








355 






360 






365 






475 


Val 


Cys 


Gly 


Asp 


Ser 


He Lys 


Pro 


Glu 


Glu 


Thr 


Glu Gin Glu 


Val 


Ala 


E- 


-> 


476 




370 








375 








380 










477 


Ala 


Asp 


Glu 


Thr 


Arg 


Asn Leu 


Pro 


His 


Ser 


Lys 


Gin Gly Asn 


Lys 


Asn 




A:\EP.txt 

N:\CRF3\02262002\J032260.raw 



Asp 


Gly 


Ala 


Val 


Ser 


Ala 


Pro 


Ala 


Ala 


Ser 


5 










10 










Ser 


Thr 


Thr 


Ser 


Gly Glu 


Glu 


Leu 


Val 


Val 










25 










30 


Val 


Gin 


Ser 


Asn 


Asn 


Phe 


Lys 


Glu 


Met 


Trp 






40 










45 




Lys 


Thr 


Ala 


Asn 


Phe 


Val 


Ala 


Val 


Asp 


Thr 


55 










60 








Asp 


Arg 


Lys 


Ser 


Leu 


Leu 


Asn 


Gin 


Cys 


He 










75 








8 


Val 


Cys 


His 


Ala 


Ala 


Arg 


Thr 


Arg 


Ser 


He 


85 








90 










Cys 


Phe 


Lys 


Arg 


Gin 


Pro 


Asp 


Lys 


Gly 


Glu 








105 










110 


Val 


Phe 


Asn 


Leu 


Thr 


Leu 


Leu 


Cys 


Met 


Glu 






120 








125 




Lys 


Ser 


Val 


Gin 


Phe 


Leu 


He 


Gin 


His 


Gly 




135 










140 






Tyr 


Ala 


Gin 


Gly 


He 


Pro 


Tyr 


His 


Lys 


Gly 



15 



(fY\ £vwJ 



95 



AM 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/032 , 260 



DATE: 02/26/2002 
TIME: 11:05:42 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\02262002\J032260.raw 



E--> 



478 

479 
480 
481 
E--> 482 

483 
E -> 484 

485 
E--> 486 

487 
E--> 488 

489 
E--> 490 

491 
E--> 492 

495 
496 
497 
498 
500 
501 
502 
503 
504 
505 
506 
507 
508 
509 
510 
511 
512 
513 
514 
515 
516 
517 
518 
519 
520 
521 
522 
523 
524 
525 
526 
527 
528 
529 



400 



480 



385 390 395 

Asp Leu Glu Met Gly lie Lys Ala Ala Arg Pro Glu lie Ala Asp Arg 

405 410 
Ala Thr Ser Glu Val Pro Gly Ser Gin Ala Ser Pro Asn Pro Val Pro 

420 425 430 

Gly Gly Gly Leu His Arg Ala Gly Phe Asp Ala Phe Met Thr Gly Tyr 

435 440 445 

Val Met Ala Tyr Val Glu Val Ser Gin Gly Pro Gin Pro Cys Ser Ser 

450 455 460 

Gly Pro Trp Leu Pro Glu Cys His Asn Lys Val Tyr Leu Ser Gly Lys 
465 470 475 

Ala Val Pro Leu Thr Val Ala Lys Ser Gin Phe Ser Arg Ser Ser Lys 

485 490 
Ala His Asn Gin Lys Met Lys Leu Thr Trp Gly Ser Ser 

500 505 
<210> SEQ ID NO: 27 *? <T*i 

<211> LENGTH: (3^3?) 3 9" 3 S ( 
<212> TYPE: DNA 
<213> ORGANISM: Homo sapiens 
<400> SEQUENCE: 27 

agcttatatt ctaatgggga cagaaaagga ataatgaaca taagtaaatt ccataagatg 60 
ttaggtgata aatattagca taaaaagcaa aaattagacc aagaggggaa aaaaaagagt 120 
gccaaggtgg ggtttaatgt tgcaatttta aagactgtgg tcaaggtaga cccaaagcat 180 
tctaagtgag tgcaaaggcc ccaaggaggg tgcctggtat gtctgtggta cagtaagtag 240 
gtcaatgtgg ttagaatgga atgagatggg actgagtggt agaagaggtc agagaagtaa 300 
accagatgag gtggggagag gagggtcaca aagtacctta taggccattg gagggatttg 360 
gctgccacac ccttgctctt agaaggcagt cctcttacta cagccttgca ggtccagtga 420 
tccgggcacc atccgcctca tcccctcact atgctctagc caaggttgac tgaatttagt 480 
tgcttaaaca cctcaagtgt gtctgcccac cttggggcct cacacaatcc atttcctctg 




540 



415 



495 



tttggactct tttatgcttt tacctaacac cttatcattt ttcaagtctt gactgaaatg 600 
tccaaatcag gtcccctcat cttatcctat cacatatttc tgccttgtag ctcttaccta 660 
atgtaatttt acattacttt gattctttcc atcagtgtgt acttcctgaa tttgactgta 720 
aaaaacgact tgagtgcaag gactgattct cttgttgatt ggtgtgtgtc caaagtcagt 780 
gccaggtaaa ctgtacacaa tagatacctg ttaaatgaat taatgggatg ggggatagtc 840 
aaaagagttt ccctttttta ggataggaga aatccaaaga gtttttttat ttttgttttt 900 
tttttgtttg tttgttttgt ttttagagac agtgtgtccc tcactttgct gctctgccac 960 
tcaggctgga gtgcaataag aacatggctc actgcagcct cgacctcctg ggctcaagcc 1020 
atcctctcac ctcagcctcc tgtagctggg actacaggtg cgcaccacca tgcccaacta 1080 
atttttaatt ttctttttgt agagacaagg tttcactatg ttgcccaggc tagtcttgaa 1140 
ctcctagggt caagcgatcc tcccaccttg gcctcctaag atgattacag gccataagcc 1200 
actgcgcccg gcccaagcag ttctgaataa tgatgaaatg ggctcagttg agagaagctg 1260 
aagattaact ataaacaatg agtaacaaag gagcactgga aggcagaggt ggatgggaat 1320 
cgtagtgttt acggagggac tagtctccaa taggaatttt tttttttttt ttttttttga 13 80 
gacggagttt cgctcttgtt gcctaggctg aagtgcaaaa tggcgtgatc tcggctcacc 1440 
gcaacctctg cctcccaggt tcaagcgatt ctcctgcctc agcctcccaa gtagtgggat 1500 
tacaggcgcc cgcaccatac ccagctaatt ttttttgtac ttttagtaga gacggggttt 1560 
caccatgttg gccaggctgg ttttgaactc cggacctcag gtaatccgcc cgcctcggcc 1620 
tcccaaagtg ctgggattac aggcgtgagc caccgcgccc ggcctaggaa cctctttcaa 1680 
attcaatcac cctctaggtc gactataccg cctagctgct tcacaatttg tcccttcctc 174 0 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/032 , 260 



DATE: 02/26/2002 
TIME: 11:05:42 



E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 
E- 



530 gccatccata 

531 aagcttccct 

532 cccgcctctt 

533 tgcccagaac 

534 tttaagctca 

535 tacccgctac 

536 gccgcgagag 

537 cgcactccag 

53 8 aggcaattag 
539 gttcgaccca 

54 0 gccaggagac 
541 tgtcatggcc 
54 2 tgtcagcaaa 
54 3 gcaaagcaac 
544 atttcgtggc 
54 5 agtgcattga 
54 6 ccctgggcct 
547 aagtgttcaa 
54 8 agttcctgat 
54 9 ataagggcaa 

550 agctaatccg 

551 tgtaccagaa 

552 gtgagatgtt 

553 tcgtggcctc 

554 gggcagctgg 

555 gaccatattg 

556 atctgtgaca 

557 cacgatattg 

558 cgacgacgta 

559 gggaagctaa 

560 aaaccgagca 

561 acaaaaatga 

562 cctcagaagt 

563 gggctggttt 

564 gaccgcaacc 

565 gtggcaaagc 

566 acaatcagaa 
569(1) 

Hap* 



tagcgcgccc ggctttccgg 
agccggcagc acaggccaat 
cccgggacgt ctgaacggaa 
tccccgactg cctgaaccgc 
ctcaggcggg agacgagcgg 
cccgcagctt ccgacggtgg 



2160 
2220 
2280 
2340 
2400 
2460 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\02262002\J032260.raw 

ctgccagcct taattcaagt tcacattatc acttgattgg attattacaa 1800 
accaatcggt cgctcttaca ccctgggcag cctcctccga tggcccactc 1860 
tcactttctg gagatcactg agctctccat cctctctggg aatttaccga 1920 
gcccttcttt cccccacacg accctctcct agtctaactc ctgggcgtgc 1980 
gctcaggca gcgtcacctt ctctggaaag cccaaaccca gccaccccaa^2640^ 
ccgcggccca cgctgatgaa gacagcagaa cacggaggcc ccgcgttccc 2 100 
caggagagaa agattacctc ccgcgagctc 
ggggcgtggc tcgggtccac ccgggctgcg 
cgcgcgccag gctgccttcc ccgcgccgga 
tcggcgaccc gacggcgaga ccccgcccca 
ggaccgcaag tccagcgtac ccacagacga 
gccgacagtg acgatggcgc agtttcagct 
agcacaacat ctgggagga gctagtagtc caggttcccg tagtggatgt 2520 
aacttcaagg agatgtggc catccctcct gctagccata aagacagcta 2580 
tgtggacacg gagctgagtg ggcttgggga caggaagagt ttgctgaacc 264 0 
ggaacgttac aaggccgtgt gtcatgctgc caggacccgt tctatccttt 2700 
cgcctgcttc aagcggcagc cagacaaggg 
tctcactctg ctgtgcatgg aggagtatgt 
acagcatggc ttcaacttca accagcagta 
tgacaagggt gatgagagcc agagccagtc 
agcccgccgg cccctggtgc tacacaatgg 
cttctatgca cacctccctg agagtctggg 
cccagcaggc atttatgaca ccaaatatgc 
ctacttagaa tatgccttcc ggaaatgtga 
cagcccacac cttaccctgg agttctgcaa 
attaccgctg ctgcctgccc ccagcaaccc 
acttctcggc ttatggctgg tgccccctgg gaccacagtg tcctcagtct 3 3 6( 
accttatcat tgacactgat gaggctgcgg cagaggacaa gcggcgacgg 3420 
gggaaaaacg gaagagggct ttattgaacct accggggaca cagacctctg 34 8( 
ggatggtcct cccaagaagc aggtctgtgg ggatagcatc aagcctgaag 354 0 
ggaggtggct gccgatgaaa ctaggaacct gcctcactcc aagcaaggca 3600 
cttagagatg gggattaagg cagcaaggcc tgaaatagct gatagagcta 3660 
gccagggagc caagccagtc ctaacccagt gcctgggggt ggattgcacc 3720 
tgatgccttt atgacaggtt atgtgatggc ctatgtggaa gtgagccagg 3780 
ctgcagctct ggaccctggc tccctgaatg ccacaataag gtatatttga 384 0 
tgtacccctc acagtggcca agagccagtt ctctcgttcc tccaaagccc 3900 
gatgaagctc acttggggca gtagctga 393 8 



tgaacattcc 
catagaacca 
tgcccaaggc 
agtacggacc 
ccttatagac 
aaccttcacc 



tatctggctc 
aagtctgtgc 
atcccctacc 
ctattcctgg 
ttggtgttcc 
gctgacctgt 



tgctgagttt catgcccgtt 
acgggaaaat gggaagcagc 
ctatccttcc agcatgagg 
accgtcctca tcccaccagc 



i 

bumbo's 

off 
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VERIFICATION SUMMARY DATE: 02/26/2002 

■ PATENT APPLICATION: US/10/032 , 260 TIME: 11:05:43 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02262002\J032260.raw 

L:12 M:270 C: Current Application Number differs, Replaced Application Number 
L:13 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:42 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 3 
L:54 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:4<-' 
M:332 Repeated in SeqNo=4 

L:66 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 5 ^ 



L: 


152 


M: 


283 


W: 


Missing 


Blank 


Line 


separator, 


<400> 


field identifier 


L: 


171 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L : 


174 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


177 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


180 


M : 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


183 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


186 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


189 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


192 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


195 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


198 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


201 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


204 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


207 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L : 


210 


M: 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


15 


L: 


213 


M:336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


15 



L:228 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:16 
M:332 Repeated in SeqNo=16 

L:264 M:340 E: (46) "n" or "Xaa" used: Feature required, for SEQ ID# : 17 / 
M:340 Repeated in SeqNo-17 

L:274 M:254 E: No. of Bases conflict, LENGTH: Input : 660 Counted: 659 SEQ: 17 " 
M:254 Repeated in SeqNo=17 

L:276 M:252 E: No. of Seq. differs, <211>LENGTH : Input : 725 Found:724 SEQ:17 *" 
L:291 M:340 E: (46) "n". or "Xaa" used: Feature required, for SEQ ID#:18 ^ 
M:340 Repeated in SeqNo=18 ^ 
L:293 M:254 E: No. of Bases conflict, LENGTH : Input : 600 Counted:599 SEQ:18 
M:254 Repeated in SeqNo=18 ^ 
L:294 M:252 E: No. of Seq. differs, <211>LENGTH : Input : 619 Found:618 SEQ:18 



L 


302 


M:340 E: 


(46) "n" 


or 


"Xaa" 


used: Feature required, 


for 


SEQ 


ID# 


:19 — - 


M 


340 


Repeated 


in SeqNo= 


=19 














L 


326 


M:340 E: 


(46) "n" 


or 


"Xaa" 


used: Feature required, 


for 


SEQ 


ID# 


:20 — 


M 


340 


Repeated 


in SeqNo= 


=20 














L 


363 


M:340 E: 


(46) "n" 


or 


"Xaa" 


used: Feature required, 


for 


SEQ 


ID# 


22 


M 


340 


Repeated 


in SeqNo= 


=22 














L 


398 


M:340 E: 


(46) "n" 


or 


"Xaa" 


used: Feature required, 


for 


SEQ 


ID# 


24 


M. 


340 


Repeated 


in SeqNo= 


=24 














L: 


410 


M:340 E: 


(46) "n" 


or 


"Xaa" 


used: Feature required, 


for 


SEQ 


ID# 


25 


M: 


340 


Repeated 


in SeqNo= 


=25 















L:411 M:254 E: No. of Bases conflict, LENGTH : Input : 120 Counted: 121 SEQ: 25 
M:254 Repeated in SeqNo=25 

L:430 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:26 
M:332 Repeated in SeqNo=26 
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VERIFICATION SUMMARY DATE: 02/26/2002 

PATENT APPLICATION: US/10/032 f 260 TIME: 11:05:43 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02262002\J032260.raw 

L:534 M:254 E: No. of Bases conflict, LENGTH : Input : 204 0 Counted:2039 SEQ:27 
M:254 Repeated in SeqNo=27 

L:569 M:252 E: No. of Seq. differs, <211>LENGTH: Input : 3938 Found: 3935 SEQ:27 
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